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(54) System and method for measuring the operation of a device. 

(57) A system and method for measuring the oper- 
ation of a device is disclosed. A plurality of 
virtual sensors which define a multi-dimen- 
sional virtual sensor space sense the operation 
of the device. A normal operation determiner 
determines at least one region of normal oper- 
ation for the device. An abnormal condition 
determiner provides a warning when the oper- 
ation does not fall within the normal operation 
region which is defined within the multi- 
dimensional virtual sensor space. A mainte- 
nance method and system are also disclosed for 
maintaining at least one device. The mainte- 
nance system includes a measurement system 
and an abnormality processor. The measure- 
ment system measures the operation of the 
device and provides an indication of when the 
device is not operating normally. The abnor- 
mality processor predicts, based on the output 
of the measurement system, when the device is 
likely to fail. 



CO 

m 

CO 

8 



Q. 

LU 




Jouve. 18. rue Saint-Denis. 75001 PARIS 



EP 0 632 353 A1 

FIELD OF THE INVENTION 

The present invention relates to measurement and maintenance systems generally and to computerized 
measurement and maintenance systems in particular. 

5 

BACKGROUND OF THE INVENTION 

Businesses which have equipment therein typically have a maintenance schedule for maintaining the 
equipment. For some businesses, the maintenance schedule is virtually non-existent; equipment is maintained 

10 or fixed only when it fails in some way. When a critical machine fails in such a factory, the entire production 
process may come to a stop. 

Some businesses solve this problem by performing preventive maintenance wherein they replace parts 
of machines at certain, predetermined intervals, whether or not the parts are close to failing. This avoids many 
failures. However, perfectly good parts are often unnecessarily replaced. 

15 Other businesses have computerized their maintenance systems by using sensors, connected to comput- 

ers, to measure the operation of their machines. The computers typically alert the maintenance personnel when 
the sensors detect a failure of some part of the machine. 

Although the computerized systems detect where failures occur, easing the job of repairing the failure, the 
computerized systems do not generally provide a lot of warning that a failure is about to occur. 

20 Device monitoring systems, using single or multiple sensors, are known, some of which are described in 

the article, "Empirical Models for Intelligent Data Validation" produced by Halliburton NUS Environmental Cor- 
poration of Idaho Falls, Idaho, USA The systems described in the article use reference data, collected when 
the device being monitored was operating as desired, to define a model of "normal device behavior". Later out- 
put data, as a result of later input signals, are compared to model output produced from the same input signals 

25 and an alert is provided if the model output does not match the real output data. 

SUMMARY OF THE PRESENT INVENTION 

Applicants have realized that a maintenance system ideally should indicate when a failure mode (i.e. the 

30 changes in the device which occur prior to actual failure) is developing. 

It is therefore an object of the present invention to provide a measurement system and method which pro- 
duce an indication of abnormal operation. When applied to a maintenance system, the measurement system 
and method indicate the development of a failure mode. When applied to a control system, the measurement 
system and method indicate movement out of an allowed region of operation. 

35 There is therefore provided, in accordance with a preferred embodiment of the present invention, a meas- 

urement system for measuring the operation of a device. The system includes a plurality of virtual sensors de- 
fining a multidimensional virtual sensor space, a normal operation determiner for determining at least one re- 
gion of normal operation for the device, and an abnormal condition determiner for providing a warning when 
the operation does not fall within the normal operation region. The region is defined within the multidimensional 

40 virtual sensor space. 

Additionally, in accordance with a preferred embodiment of the present invention, the region can be defined 
as at least one quantized region around a datapoint within the multidimensional virtual sensor space. The da- 
tapoint describes one state of a normally operating device. 

Alternatively, in accordance with a preferred embodiment of the present invention, the system includes 

45 an operation space def iner for defining a reduced dimension operation space from the multi-dimensional vir- 
tual sensor space. Datapoints of the virtual sensor space describing one state of a normally operating device 
are stored. The region is defined as a collection of distances of the datapoints from the axes of the operation 
space. 

Moreover, in accordance with a preferred embodiment of the present invention, the virtual sensors corn- 
so prise physical sensors and computed sensors, wherein the computed sensors are sensors whose output is a 
function of output of at least one physical sensor. 

There is also provided, in accordance with a preferred embodiment of the present invention, a maintenance 
system for maintaining at least one device. The maintenance system includes a measurement system for meas- 
uring the operation of the device and for providing an indication of when the device is not operating normally 
55 and an abnormality processor for predicting, based on output of the measurement system, when the device is 
likely to fail. The measurement system is as described hereinabove. 

Additionally, in accordance with a preferred embodiment of the present invention, the abnormality proc- 
essor has stored therein a list of failure paths each having a multiplicity of nodes and determines if the abnormal 
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condition indication falls within the nodes of the failure paths. 

Moreover, in accordance with a preferred embodiment of the present invention, the abnormality processor 
additionally predicts when the device will fail based on which node on the failure path the indication fell and 
on how long it took the device to reach the node. 

5 Furthermore, in accordance with a preferred embodiment of the present invention, the abnormality proc- 

essor rebuilds the normal operation region when the indication is falsely produced. 

There is also provided, in accordance with a preferred embodiment of the present invention, a method of 
measuring the operation of a device. The method includes the steps of a) defining a multi-dimensional virtual 
sensor space from output data of a plurality of virtual sensors, b) determining at least one region of normal 

10 operation for the device, and c) providing a warning when the operation does not fall within the normal operation 
region. The normal operation region is defined within the multi-dimensional virtual sensor space. 

Additionally, in accordance with a preferred embodiment of the present invention, the step of determining 
includes the step of defining a reduced dimension operation space from the multi-dimensional virtual sensor 
space. The reduced dimension operation space has fewer axes than the multi-dimensional virtual sensor 

15 space. 

Further, in accordance with a preferred embodiment of the present invention, the step of determining ad- 
ditionally includes the step of defining, from a plurality of datapoints within the multi-dimensional virtual sensor 
space each describing one state of a normally operating device, the region as a collection of distances of the 
datapoints from the axes of the reduced dimension operation space. 

20 There is further provided, in accordance with a preferred embodiment of the present invention, a method 
for maintaining at least one device. The method includes the steps of a) measuring the operation of the device 
and providing an indication of when the device is not operating normally and b) predicting, based on output of 
the step of measuring, when the device is likely to fail. The step of measuring is similar to the method of meas- 
uring described hereinabove. 

25 Additionally, in accordance with a preferred embodiment of the present invention, the step of predicting 
includes the steps of a) providing a list of failure paths where each path has a multiplicity of nodes; and b) 
determining if the abnormality indication falls within the nodes of the failure paths. 

Moreover, in accordance with a preferred embodiment of the present invention, the step of predicting ad- 
ditionally includes the step of producing a prediction of when said device will fail based on which node on the 

30 failure path the abnormality indication fell and on how long it took the device to reach the node. 

Further, in accordance with a preferred embodiment of the present invention, the step of predicting addi- 
tionally includes the step of rebuilding the normal operation region when the abnormality indication is falsely 
produced. 

There is still further provided, in accordance with a preferred embodiment of the present invention, a meas- 
35 urement system including a plurality of virtual sensors, a normal operation determiner and an abnormal con- 
dition determiner. The ranges of output from the plurality of virtual sensors define a multi-dimensional virtual 
sensor space. The normal operation determiner, which is coupled to the plurality of virtual sensors at least dur- 
ing installation, defines at least one multidimensional normal operation region from at least one datapoint within 
the multi-dimensional virtual sensor space. The datapoint is defined from output of the plurality of virtual sen- 
40 sors received at least during the installation. The abnormal condition determiner, which is coupled to the plur- 
ality of virtual sensors during a period of regular operation, provides an abnormality indication when output of 
the plurality of virtual sensors during the period of regular operation does not fall within the at least one normal 
operation region. 

Finally, there is provided, in accordance with a preferred embodiment of the present invention, a mainte- 
45 nance system including a region and path storage unit, a measurement system and an abnormality processor. 
Te region and path storage unit stores (i) at least one multi-dimensional normal operation region defined by 
at least one multi-dimensional state which an operator determines as indicating normal operation of the device 
and (ii) a list of failure paths, each path ending in a different failure and each path having a multiplicity of multi- 
dimensional nodes. Each multi-dimensional node is defined by a multidimensional abnormal operation region 
50 defined by at least one multi-dimensional state defined by the operator as indicating abnormal operation of 
the device. 

The measurement system includes measurement apparatus and comparison apparatus. The measure- 
ment means measures the operation of the device and the comparison means compares output of the meas- 
urement apparatus with the normal operation region and provides an abnormality indication when the output 
55 of the measurement apparatus does not fall within at least one of the normal operation regions. The abnormality 
processor determines when the abnormality indication falls within at least one of the multi-dimensional nodes 
of the failure paths and predicts therefrom when the device is likely to fail. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention will be understood and appreciated more fully from the following detailed description 
taken in conjunction with the drawings in which: 
5 Fig. 1 is a block diagram illustration of a measurement system, constructed and operative in accordance 

with the present invention; 

Fig. 2A is a graphical illustration of a quantized two dimensional sensor space, useful in understanding a 
first embodiment of operation of operation measurement units of the system of Fig. 1; 
Fig. 2B is a graphical illustration of a two dimensional sensor space and a principal component indicating 
10 the interrelation between the two sensors, useful in understanding a second embodiment of operation 

measurement units of the system of Fig. 1; 

Fig. 2C is a graphical illustration of a plurality of regions of normal operation, useful in understanding a 
second embodiment of the operation of the system of Fig. 1 ; 

Fig. 3 is a block diagram illustration of elements of an operation measurement unit forming part of the sys- 
15 tern of Fig. 1; 

Fig. 4 is a flow chart illustration of an abnormality processor forming part of the system of Fig. 1, when 
the system of Fig. 1 is implemented in a maintenance system; 

Fig. 5 is a schematic illustration of a two-dimensional, conceptual normal operation region and a plurality 
of failure paths extending from it, useful in understanding the operations illustrated in Fig. 4; and 
20 Fig. 6 is a graphical illustration of position datapoints useful in understanding the prediction operation util- 

izing the failure paths of Fig. 5. 

DETAILED DESCRIPTION OF A PREFERRED EMBODIMENT 

25 Reference is now made to Fig. 1 which illustrates the measurement system of the present invention. As 

mentioned hereinabove, the measurement system can be implemented in a plurality of types of systems, in- 
cluding maintenance and control systems. 

The measurement system typically comprises a plurality of operation measurement units 10, each moni- 
toring the operation of one device 12 and an installer 13 for installing the measurement units 10 on its corre- 
30 sponding relevant device 12. For maintenance systems, the system of the present invention additionally com- 
prises an abnormality processor 14 for determining if the device 12 is about to fail. The device 12 can be any 
piece of equipment, such as a fan, compressor, pump, rotating device, boiler, etc. 

Connected to each measurement unit 1 0 are a multiplicity of sensors 1 5 for sensing the activity of the ap- 
propriate device 12. The sensor data is processed by the measurement unit 10, in accordance with the present 
35 invention, and, if desired, the results are provided to the abnormality processor 14. 

The type of sensors 1 5 utilized varies according to the device 12 to be measured. For example, if device 
12 is a pump, then its corresponding sensors 15 are pressure sensors measuring the input and output pres- 
sures, an input power sensor, and a current sensor sensing the input current. 

The measurement units 10 typically provide information regarding the operating condition of the device 
40 12. The information typically comprises warnings received whenever the device 1 2 develops an abnormal con- 
dition and alarms, similar to those provided by systems of the prior art, received whenever the device 12 has 
reached a failure condition. 

The output of a single sensor 15 does not completely indicate the entire operation of the device 12, nor 
does it reflect the significant changes which occur as a failure approaches. Each sensor provides the symp- 
45 toms, as it were, of the device, but not the complete operation of the device. To view the complete operation, 
one must view the sensor data as a whole. For the purposes of the present discussion, the "sensor data" is 
the output of the sensors 15 and of computed sensors whose output is computed from the output of one or 
more sensors 15. The total number of "virtual sensors" (i.e. sensors and computed sensors) is M. 

In accordance with the present invention, the measurement system defines the normal mode of operation 
so as a region or regions within an M-dimensional space 20 (Figs. 2A and 2B) of sensor output The regions are 
formed from at least some of the sensor datapoints 24 produced during a plurality N of predetermined, instal- 
lation operations. 

Applicants present herein two embodiments, illustrated in Figs. 2A- 2C to which reference is now made, 
for defining the normal mode of operation, it being understood that other embodiments which define the normal 
55 mode of operation within an M-dimensional space are also included within the scope of the present invention. 

The first embodiment is illustrated two-dimensionally in Fig. 2A. The axes of Fig. 2A are the output of two 
sensors X, and X 2 . Each axis is quantized to a desired quantization level, where, in Fig. 2A, the quantization 
is noted by hatch marks 22. 
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The quantization breaks the space 20 into a plurality of M-dimensional "cubes" 23 in some of which, labeled 
25, are located a datapoint 24 from the installation operations. In accordance with this embodiment, the cubes 
25 which have datapoints therein are defined as the regions of the normal operation condition. 

If a datapoint 24 produced during regular operation falls within one of the regions 25, then it indicates that 
5 the device 12 is operating normally. Otherwise, device 12 is operating abnormally and the measurement unit 
10 produces a warning, called herein an "abnormal condition". 

It is noted that the datapoints 24 are often redundant (i.e. the M virtual sensors provide less than M pieces 
of information). Therefore, in accordance with a second embodiment of the present invention, illustrated for 
two-dimensions in Figs. 2B and 2C, the operation of the device 1 2 is extracted from the output of the M virtual 
10 sensors and is used to determine a single region of normal operation. 

To do this, a reduced operation space of K separate, uncorrected (i.e. orthogonal) bits of information is 
first determined. The datapoints 24 are then defined within the operation space, producing thereby a "normal 
operation region" (NOR) within the reduced operation space. All future sensor data are transformed to the op- 
eration space and compared to the NOR. 
15 The operation space is determined in accordance with the principles of Principal Component Analysis 
(PCA), described in the article by MacGregor, J.F. et al. in the article, "Multivariate Statistical Methods in Proc- 
ess Analysis and Control", Fourth International Conference on Chemical Process Control, South Padre Island, 
Texas, 1991. The calculations for a two sensor system, are illustrated in Figs. 2B and 2C. 

PCA involves computing a minimum number K of orthogonal basis vectors P) from the M-dimensional sen- 
20 sor space, wherein the basis vectors capture the operation of the device 12. Fig. 2B shows a plane of the output 
of two sensors and X 2 and a basis vector Pi which indicates the operation of the device 12. 

The NOR 26 is defined by determining, for each datapoint 24, the distance D from it to its projection T, on 
each basis vector Pj. It is noted that the distances D are functions of the projections T| and only the largest 
value of D(Tj) for each value of T t is stored. The resultant collection of distances D(T,) describes the normal 
25 operating condition of the device 12 within the K-dimensional basis vector space. The NOR 26 of the device 
12 is the region defined by D(T|), the axes T, and the observed limits of T, abstracted from virtual sensor data. 
If desired, the collection of distances D(T|) can be modeled by an R order polynomial. 

It will be appreciated that the distances D(T|) represent the locations of the measured datapoints 24, but 
in a more compact and useable format. Thus, NOR 26 can also be defined as the region bounded by those 
30 datapoints 24 selected to be part of D(T,). 

During regular operation, each measurement unit 10 calculates distances D(T,) for each datapoint received 
and determines whether or not the results fall within the NOR 26. If not, a warning is provided. 

If, after a warning has been received, it is determined that the abnormal datapoint in fact, is not abnormal 
(i.e. it is an allowed sensor value which was not measured during the initial installation operations), the basis 
35 vectors and the NOR 26 must be recomputed with the new datapoint. 

Reference is now made back to Fig. 1 . For maintenance systems, the measurement units 1 0 communicate 
with the abnormality processor 14. The communication can occur through a direct connection to the abnor- 
mality processor 14 or, if the abnormality processor 14 is located a significant distance from the measurement 
units 10, each measurement unit 10 can have a transmitter 16 which transmits data to a receiver 18 located 
40 on the abnormality processor 14. Alternatively, the abnormality processor 14 and measurement unit 10 can 
be formed into a single unit placed onto or near the device 12 to be monitored. 

The abnormality processor 14 sorts the abnormal conditions received from the measurement units 1 0 into 
three categories, (1) part of an already known failure path where each path ends in a different type of failure, 
(2) to be added to the NOR 24, or (3) an unknown failure probably due to some external, non-maintenance 
45 related, phenomenon. 

If the abnormal condition is part of a known failure path, the processor 14 typically provides an indication 
of the type of expected failure and a prediction of time to failure. Thus, a factory using the maintenance system 
of the present invention only has to repair those devices 12 which are likely to fail within a short period of time, 
and, the repair operation can be scheduled so that it does not interfere with the activity of the factory. Similar 
so devices 12 in the factory which are still operating "normally" are not taken out of production during the repair 
operation. 

Since the abnormality processor 14 receives abnormal condition information from a wide range of devices 
12, often located within a plurality of factories, the failure prediction is generally more accurate than that pro- 
duced based only on the abnormal conditions and failures of the devices 12 in a single factory. 
55 The installer 1 3 stores data regarding each type of device 1 2 monitorable by the system. The data includes: 

(1) a list of sensors 15 required for the device 12; 

(2) instructions as to how to connect the sensors 15 to the device 12; 

(3) the definitions of the computed sensors; and 
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(4) a list of installation operations to be performed in order to define the NOR 26. 

The installer 1 3 also determines the NOR 26 from the datapoi nts 24 of the installation operations. It is noted 
that the installation operations can be performed one right after each other. Alternatively, the installer 13 can 
collect data only when the device 12, as part of its regular operations, performs one of the installation opera- 
5 tions. 

Reference is now made to Fig. 3 which illustrates the elements of one measurement unit 10. Each unit 10 
typically comprises an input processor 30, a sensor signal processor (SSP) 32, a storage unit 33 for storing 
the "normal operating condition" (NOC) comprised of the basis vectors and the NOR 26 and an abnormality 
determiner 34 for determining if the current operation is within NOR 26. 

10 The input processor 30 typically performs initial processing of the signals from the sensors 15. Processor 

30 typically comprises a filter 35 for improving the signal-to-noise ratio of the signals from the sensors 15 and 
an analog-to-digital (A/D) converter 36 to convert the typically analog signals to digital data. The A/D converter 
36 typically samples the signals at a high sampling rate, such as 10KHz. 

The input processor 30 additionally comprises a filter 37 for computing the stationary parameters of the 

15 slower signals and a fast Fourier transformation unit (FFT) 38, operating on the faster signals, information re- 
garding the energy of the data over the frequency band. The FFT unit 38 can produce the amplitude values 
for selected frequencies, such as multiples of the speed of revolution of the device, or it can produce values 
for the energy within selected frequency bands. For later calculations, the output of the FFT unit 38 for each 
frequency or frequency band is considered a separate virtual sensor. 

20 Filter 37 can be any appropriate filter, such as a Butterworth low pass filter having a cutoff frequency of 

mhz. For maintenance systems, FFT unit 38 typically operates once every 20 seconds on a window of approx- 
imately 1 second long. Its output is provided to the SSP 32. 

The SSP 32 typically normalizes the sensor data so that all signals are defined in the same non-dimen- 
sional units. Furthermore, the SSP 32 typically converts the sensor output to a virtual sensor space, where, 

25 as mentioned hereinabove, each virtual sensor is either the same as one of the sensors 15 or a computed 
version of the output of one or more sensors 15. The output of the computed sensors indicate known, but un- 
measurable, relationships among the real sensors 15. 

For example, a computed sensor might be one of the amplitudes or energy values produced by the FFT 
unit 38. In another example, a computed sensor might be the pressure head H developed by a pump in response 

30 to the power P supplied to it. The pressure head H and power P are related by the function: H = A + B*P - 
C*P 2 , where the values for A, B and C are specific to each device 12 being monitored. Each relationship may 
have a tolerance associated therewith. 

The abnormality determiner 34 determines whether or not each virtual sensor datapoint, received from 
the SSP 32, falls within the region or regions of the NOR 26. If not, it provides a warning, indicated on Fig. 3 

35 as an "abnormal condition". 

For the first embodiment, the determination involves determining that the new virtual sensor datapoint is 
quantized to one of the regions 25 of the NOR 26. For the second embodiment, abnormality determiner 34 
calculates the distance D of the new virtual sensor datapoint from the basis vectors P t and determines whether 
or not the distance D is smaller, for the same value of T if than the value of D(T,) which forms part of the NOR 

40 26. 

In either case, when an abnormal condition exists, the abnormality determiner 34 provides on output the 
datapoints of the virtual sensors and the location of each datapoint within the M-dimensional space. For the 
second embodiment, the distance D(T|) is also provided. 

The operations of the abnormality determiner 34 are described in more detail hereinbelow. 
45 Reference is now made to Figs. 4 and 5. Fig. 4 illustrates, in flow chart format, the operations of the ab- 

normality processor 14. Fig. 5 illustrates a conceptual generic NOR, labeled 26', and a plurality of failure paths 
50 extending from it, useful in understanding the operation of the abnormality processor 14. 

The axes in Fig. 5 are D and two projections and T 2 and the NOR 26' is shown as a collection of cylinders 
DO^T^. The NOR 26' is typically a generic NOR 26, constructed from "good" NORs of a plurality of similar 
so devices. The data from the "good" NORs are scaled, standardized and merged in the process of constructing 
NOR 26'. 

Each failure path 50 ends in a type of failure, such as a broken axle, a serious leak in a pipe, etc. The path 
itself represents the abnormal conditions the device 1 2 passes through as the condition which eventually caus- 
es the failure develops. The final, failure condition, labeled 54, is a single point or region. 
55 In Fig. 5, the abnormal conditions are represented as K+1 -dimensional spheroids 52. The failure paths 50 

are represented mathematically as fuzzy sets and the spheroids 52 are represented as nodes. Each spheroid 
52 is expressed as a membership function, with its center having an association membership of 1 (100% likely 
that the center is the current failure state), decreasing to 0 as a function of the distance from the center. 
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The failure paths 50 can be determined in any of a number of ways. Experiments can be performed in 
which the device is caused to fail in a desired way within a short period of time. At given intervals, the abnormal 
conditions produced by the measurement unit 10 are stored, as is the final condition. The experiment is re- 
peated a number of times, thereby to produce a plurality of abnormal conditions at each interval of time. Each 
5 spheroid 52 is then a fuzzy node representing the range of abnormal conditions received at one interval of 
time. 

Alternatively, a failure path can be determined in the course of operating with the measurement system 
of the present invention. Every time an unknown abnormal condition occurs, it and the expected failure are 
converted to the generic NOR 26' and saved. Once a failure path 50 has been fully determined for a given 

10 failure, the trajectory defined by the abnormal conditions on the path is divided into a small number of interval 
points, wherein each point defines the center of a spheroid 52. Each abnormal condition on the path is then 
associated with a spheroid 52 in accordance with fuzzy logic. 

If desired, an operator-estimated time-to-failure can be requested for each abnormal condition on the fail- 
ure path 50. The operator-estimated time-to-failure of each spheroid 52 is the average of the estimated times- 

15 to-failure of the abnormal conditions which comprise the spheroid 52. The operator-estimated times-to-failure 
can be utilized for failure prediction. 

The generic failure paths 50 are formed of ordered lists of the nodes (spheroids 52). To be useful, the failure 
paths 50 are transformed into the specific PCA axes P ( for each device 12. The transformed failure paths, for 
each device, are stored in a list 56 and are utilized by the abnormality processor 14, as described hereinbelow. 

20 When a new abnormal condition is received from one of the measurement units 10, it is first tested, in 
step 60, against each known failure path 50 in the list 56. This involves computing the membership function 
for each node as a function of the new abnormal condition. The node with the highest association membership 
value is defined as the node to which the new abnormal condition belongs. 

For the three-dimensional space shown in Fig. 5, the membership function M is defined in equation (1) 

25 below, where the location of the new abnormal condition is (T 1f T 2 .D), the center of the node is (T^.^D'J) and 
the size of the node is defined by a tolerance defined as (T^.T^.D 5 ). 

-(J,-T*,f -(J 2 -T» t f -(D-D') 2 
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The device 12 which is developing a failure, the failure being developed and an estimated time-to-failure 
35 are typically provided to the maintenance personnel responsible for the device 12 so that they can repair the 
device, as necessary. 

The estimated time-to-failure can be determined in any number of ways. It can simply be the operator- 
estimated time-to-failure associated with the node. Alternatively, the time-to-failure can be estimated based 
on the speed with which the device 12 passes along a failure path 50. 

40 Once the device 12 has produced abnormal conditions belonging to three neighboring nodes 52 on a failure 

path 50, the velocity and acceleration of the device 12 along the failure path 50, as functions of the projections 
T| or the distance D, can be determined using backward difference formulae. Equations of motion can then be 
utilized to determine the time to failure, since the location of the final failure point or region is known. The spe- 
cific equations are provided hereinbelow beginning with Equation 14. 

45 If the new abnormal condition is not on a failure path, in step 64 it is sorted by an expert into one of three 

cases, part of a possible new failure path, an abnormal condition which should be part of the NOR 26, or some 
unknown failure which probably is due to an external, non-maintenance related cause, such as lack of gas in 
a compressor, lack of power, etc. In the cases of a new failure path or an externally caused failure, the system 
of the present invention apprises the maintenance personnel of the problem and the maintenance personnel, 

50 after defining the problem based on his experience, indicate the source of the abnormal condition and based 
on that information, the abnormal condition is sorted. 

The expert can be anyone who understands how the devices 12 which are being monitored operate. Since 
the expert sorts abnormal conditions from many types of devices 12 and many devices 12 of one type, he 
should be able to identify failure paths for one type of device based on failure paths seen on other types of 

55 devices. 

If the new abnormal condition is a false alarm and, in reality, indicates normal operation, the NOC (i.e. the 
basis vectors P| and the NOR 26) must be recomputed (step 66) and then stored in the storage unit 33 of the 
relevant measurement unit 10. The recomputation can be performed either by the abnormality processor 14 
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or by the installer 13. 

Other methods of processing abnormality and predicting failures are possible and are included within the 
scope of the present invention. 

The installer 1 3 is utilized for all types of measurement systems and is operative to set up the measurement 
5 unit with the definitions of the computed sensors and to create the initial NOC. 

Initially, the installer 13 inputs the computed sensor definitions and defines the NOR 26 as having no 
points. The device 12 then performs the installation operations, either at once or over time. The measurement 
unit 10 attached to the device 12 operates and, since the NOR 26 is 0, produces abnormal conditions for the 
entirety of datapoints received from the device 12. 
10 An expert, such as the one described hereinabove for the abnormality processor 14, sorts the abnormal 
conditions thus produced. In most cases, he will define each of them as being part of the normal operating 
condition of the device 12. 

Once abnormal conditions from all of the installation operations have been received and sorted, the basis 
vectors P, and the NOR 26 are generated and then stored in storage unit 33 of measurement unit 10. 
15 It is envisioned that, after many thousands of one type of device 12 have been operated with the meas- 

urement system of the present invention, a few, selectable normal operating conditions will be fully described. 
The selected normal operating condition will then be stored in storage unit 33 without the need to perform the 
installation operations and receive their results. 

It is noted that, if a sensor fails, the basis vectors P, and NOR 26 have to be recomputed. In this situation, 
20 the original N observations are utilized but for only M-1 sensors. The recomputation can be performed either 
by installer 13 or by abnormality processor 14. 

The normal operation condition is calculated by the abnormality processor 14 and the installer 13 as fol- 
lows: 

The N data points for each of the M sensors is collected into M raw data vectors, V. The data is then stan- 
25 dardized: 

v » = S^ = 1 " M - i = 1 " N (5) 

and stored in a (N x M) matrix X. The matrix X is then copied into a matrix Y of the same dimensions. A matrix 
X_hat holding the current representation of the data as modeled by the Principal Component Analysis is ini- 
30 tialized to 0. 

A(MxM) matrix A is formed according to: 

A = Y*Y (6) 

The largest eigenvalue and its corresponding eigenvector, which is the principal component, P k , are calculated. 
Since A is symmetrical, the tridiagonal form of A is calculated by Householder reduction and the first iteration 
35 of the QL algorithm is used to yield P k . The details of the QL algorithm are described on pages 350 - 363 of 
the book Numerical Recipes by Press, W.H. et al, Cambridge University Press, 1986. 

The vector of projections t* of the N datapoints on the k'th principal component P k are computed: 

tk = XP k (7) 

The model X_hat is recomputed: 
40 XJiat = XJiat + yy (8) 

The average squared prediction error SPE, defining how well the model matches the datapoints, is calcu- 
lated: 

SPE = (1/MN) * (sum(X,j - XJiatgft (9) 
where the sum is for i from 1 to N and j from 1 to M. If the value of SPE is less than a predetermined value, 
45 then the k principal components already computed are sufficient to describe the N datapoints. If not, the in- 
formation in the working matrix Y which is modeled by the k'th principal component is removed, as follows: 

Y = Y - yv (10) 

The variable k is increased by 1 (i.e. k = k+1) and the above steps are repeated, beginning from equation 7. 
Once all the principal components have been determined, the resultant PCA model is: 
so XJiat = tiPiT+ t 2 P 2 T + ... + yv (11) 

The distances D and scores T have to be determined, as follows: 

D, = sum(X {j - XJiaty) 2 , j = 1 .. N (12) 
Ty = XPj,j = 1..k,i = 1..N (13) 
Each point D| is a function of the scores Ty along each principal component 
55 It will be appreciated that the normal operating regibh is the collection of values D(T ( ). 

Generating an equation of motion. The current velocity and acceleration vectors can be estimated using 
past data using backward difference formulae. For example, using the three data points shown in Fig. 6, dis- 
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crete approximations for % and £ , particular components of the velocity vector ■^[r 1i r 2l D] T and of the accel- 

at 

eration vector [7^ , T 2 ,D] T , can be derived in terms of point wise information of the component x of the vector 
[7-,OT 

^AVAT/ AT* m ■ 1 AT? 1) + *»- "» (U) 

^ AW^A^ -^- * (15) 

Note that if Ar 1 =Ar 2 =Ar these equations simplify to: 

MB- MbgbHAJ (16) 

t w jaa^tifejmtSLJ (17) 

A7 2 

Using these approximations, the velocity and acceleration for changes in 7 1f 7 2 and D can be estimated. 
For each of these, an equation of motion can be formulated and solved in order to describe the motion along 
the failure trajectory in (T r T T D) space. Since the current position along the failure branch and the failure point 
are known, the equation of motion can be used to predict the time of intercept with the failure point. For a given 

component (i.e. one of 7 1t 7 2 or D) with local velocity and acceleration estimated as above, %(t f ) and £ (t f ) , we 
can write the distance covered after time t| which will be given by the equation: 



(18) 



Solving for time-of-failure. If the distance to the failure level of the component is X, then the time to failure 
can be derived from equation (18), namely: 



T " U) ^ 0.5t< Cl > 

(19) 

If, for a given component, the discriminant (the term in the square root) is negative, which could occur if 
the acceleration is negative and sufficiently large, then it is possible that the failure level may not be reached 
by that component If the acceleration term is positive, the failure level will always be reached at some time 
and equation (19) will give a non-zero answer 

Xf2U) ^ o.siU,) 

(20) 



Either of the two methods proposed will predict the intercept with the known failure point (or line, or group). 
This is TfcW , which was recorded at time t=tj. 

The reported failure estimate at time t=t< should be quantified by confidence limits. Since we would expect 
that failure time estimates will become more accurate with time, it makes sense to use a moving buffer of es- 
timated failure times in order to compute the standard deviation statistic. Thus we define the size of this buffer, 
n b . The i'th average estimated time of failure x^/) can be computed: 
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T tt (i)»-22 

i2h 



(21) 



It is not the recommended measure to use, since it will be strongly influenced by early estimates, which 
10 are in general less reliable. However, it can be used to compute the standard deviation of the failure time es- 
timate: 



£ <t r2 (i-j)-T r2 (i)) 2 

J=2 

(n b -l) 

(22) 



The standard deviation can be used to delineate the estimated time of failure. As in the lower level algo- 
rithm, the time-oMailure estimate can then be stated as: 

^/) = XA2(«±3.a^) (23) 

25 where the estimate is bounded by 95% confidence limits. Clearly, it is important to establish rules which will 
govern when to publicize the estimate. One set recommendations can summarized as follows: 

"If the uncertainty margins are more than 30% of the estimate itself (i.e. the standard deviation is more 
than 10% of the estimate), do not publicize the estimate". 

It will be appreciated by persons skilled in the art that the present invention is not limited to what has been 
30 particularly shown and described hereinabove. Rather the scope of the present invention is defined only by 
the claims which follow: 
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Claims 

35 

1 . A measurement system for measuring the operation of a device, the measurement system comprising: 

a plurality of virtual sensors defining a multidimensional virtual sensor space; 
a normal operation determiner for determining at least one region of normal operation for said de- 
vice, said at least one region being defined within said multi-dimensional virtual sensor space; and 
40 an abnormal condition determiner for providing a warning when said operation does not fall within 

said at least one normal operation region. 

2. A measurement system according to claim 1 and wherein said at least one region is at least one quantized 
region around a datapoint within said multi-dimensional virtual sensor space wherein said datapoint de- 

45 scribes one state of a normally operating device. 

3. A measurement system according to claim 1 and additionally comprising an operation space def iner for 
defining a reduced dimension operation space from said multi-dimensional virtual sensor space, wherein 
said reduced dimension operation space has fewer axes than said multi-dimensional virtual sensor space. 



50 



55 



A measurement system according to claim 1 and additionally comprising a region def iner for def ining, from 
a plurality of datapoints within said multi-dimensional virtual sensor space each describing one state of 
a normally operating device, said at least one region as a collection of distances of said datapoints from 
said axes of said operation space. 

A measurement system according to claim 1 and wherein said virtual sensors comprise physical sensors 
and computed sensors, wherein said computed sensors are sensors whose output is a function of output 
of at least one physical sensor. 
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6. A maintenance system for maintaining at least one device, the system comprising: 

a measurement system for measuring the operation of said device and for providing an indication 
of when said device is not operating normally; and 

an abnormality processor for predicting, based on output of said measurement system, when said 
5 device is likely to fail. 

7. A maintenance system according to claim 6 and wherein said measurement system comprises: 

a plurality of virtual sensors defining a multidimensional virtual sensor space; 
a normal operation determiner for determining at least one region of normal operation for said de- 
10 vice, said at least one region being defined within said multi-dimensional virtual sensor space; and 

an abnormal condition determiner for providing a warning when said operation does not fall within 
said at least one normal operation region. 
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8. A maintenance system according to claim 7 and wherein said abnormality processor comprises a list of 
failure paths each having a multiplicity of nodes and a processor for determining if said indication falls 
within said nodes of said failure paths. 

9. A maintenance system according to claim 8 and wherein said abnormality processor additionally com- 
prises a predictor for predicting when said device will fail based on which node on said failure path said 
indication fell and on how long it took said device to reach said node. 

1 0. A maintenance system according to claim 7 and wherein said abnormality processor comprises a normal 
operation region rebuilder operative to rebuild said normal operation region when said indication is falsely 
produced. 

25 11. A method of measuring the operation of a device, the method comprising the steps of: 

defining a multi-dimensional virtual sensor space from output data of a plurality of virtual sensors; 
determining at least one region of normal operation for said device, said at least one region being 
defined within said multi-dimensional virtual sensor space; and 

providing a warning when said operation does not fall within said at least one normal operation re- 
gion. 
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12. A method for maintaining at least one device, the system comprising the steps of: 

measuring the operation of said device and providing an indication of when said device is not op- 
erating normally; and 

35 predicting, based on output of said step of measuring, when said device is likely to fail. 
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